A Very Large Dictionary With Paradigmatic, Syntagmatic, And Paronymic

نویسندگان

  • Igor A. Bolshakov
  • Alexander F. Gelbukh
چکیده

A very large Russian dictionary is described. It contains currently 3.6 million links between its 120,000 entries. The links are syntagmatic (collocations), paradigmatic (WordNet-like), or paronymic (words similar in letters or in morphs). The entries of the dictionary are singleor multiwords belonging to four main POS. The entries represent so-called grammemes rather than lexemes: e.g., nouns are represented as singular and plural; verbs are split into ‘finite forms + infinitive’, ‘participle’, and ‘gerund’. The multiword entries in turn can be collocations—idiomatic free—whose parts are also entries of the same dictionary.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Correspondence of Syntagmatic and Paradigmatic Axes Relations, and Their Transformation in Relation to the Communicative Role of Shahnameh Illustration in Shiraz School of Miniature

When treated like texts with their own visual language, illustrations from the Shiraz School of miniature are a mixture of the syntagmatic and paradigmatic relations of signs. Syntagmatic relations reveal the different ways the elements of a text are connected, while paradigmatic relations identify the sets of signifiers that signify the content of the text, dealing with intratextual and intert...

متن کامل

Global Evaluation of Random Indexing through Swedish Word Clustering Compared to the People's Dictionary of Synonyms

Evaluation of word space models is usually local in the sense that it only considers words that are deemed very similar by the model. We propose a global evaluation scheme based on clustering of the words. A clustering of high quality in an external evaluation against a semantic resource, such as a dictionary of synonyms, indicates a word space model of high quality. We use Random Indexing to c...

متن کامل

Lexical Functions of Explanatory Combinatorial Dictionary for lexicalization in text generation

In this paper, we show that the Lexical Functions (LFs) of the Explanatory Combinatorial Dictionary are well suited for addressing some lexical questions in text generation like collocations and textual phenomena. We first describe LFs and divide them into two subsets: paradigmatic LFs and syntagmatic LFs. In particular, we see how paradigmatic LFs can be used for coreferential relations and de...

متن کامل

Capturing Paradigmatic and Syntagmatic Lexical Relations: Towards Accurate Chinese Part-of-Speech Tagging

From the perspective of structural linguistics, we explore paradigmatic and syntagmatic lexical relations for Chinese POS tagging, an important and challenging task for Chinese language processing. Paradigmatic lexical relations are explicitly captured by word clustering on large-scale unlabeled data and are used to design new features to enhance a discriminative tagger. Syntagmatic lexical rel...

متن کامل

From Machine Readable Dictionaries to Lexicons for NLP: the Cobuild Dictionaries - a Different Approach

We describe the results of a syntactic-semantic parser for Cobuild dictionary definitions. Unlike previous work on the automatic analysis of machine readable dictionaries, the particular structure of the Cobuild definition allows us to derive information that classifies the lexical item mainly in terms of the selectional restrictions or preferences encoded on its arguments. The resulting formal...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004